AMENDMENT UNDER 37 C.F.R. §1.111 Attorney Docket No.: CQ10210 

U.S. Appln.No.: 10/729,915 

AMENDMENTS TO THE CLAIMS 

This listing of claims will replace all prior versions and listings of claims in the 

application: 

LISTING OF CLAIMS: 

1 . (currently amended) A method for organizing a plurality of data files using meta- 
data wherein at least one meta-data element is associated with each data file, the method 
comprising: 

extractin g, for at l e ast some selected data fil e s, the at least one meta-data element 
associated with a r e sp e ctive selected each data file; 

organizing the extracted meta-data elements into a single ordered set wherein the set is 
ordered consecutively based on values for one or mor e of the extracted meta-data elements; 

calculating pair-wise differences between values of each of the extracted meta-data 
element s for all possible pairs of data files ; 

inputting at least one value of a clustering sensitivity parameter, said clustering 
sensitivity parameter defining granularity of the clustering, and multiplying each pair-wise 
difference by the clustering sensitivity parameter to obtain a plurality of similarity values for 
determining clustering based on the similarity values and at a granularity defined by the 
clustering sensitivity parameter; and 

dividing the selected data files into groups based on the similarity values. 
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2. canceled. 

3. (previously presented): The method of claim 1, wherein determining the 
similarity value of a one pair comprises evaluating the similarity value as: 

^(U) = exp^-t^j, 

where: 

the one pair consists of the data file and f h data file; 

S K (i, j) is the similarity value for the f h data file and the / h data file; 

K is the clustering sensitivity parameter value; 

ti is the value of a selected extracted meta-data element associated with the z' th 
data file; and 

tj is the value of a selected extracted meta-data element associated with the f 1 
data file. 

4. (previously presented): The method of claim 1, wherein deterrnining the 
similarity value of a one pair comprises determining the similarity value of the one pair as: 
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S K (i,j) = cxd 




n KiK-i 



-1 



where: 

the one pair consists of the I th data file and 7 th data file; 

S K (z , j) is the similarity value for the /* data file and the / h data file; 

K is the clustering sensitivity parameter value; and 

v, and Vj are actual vector values determined from the 1 th and the j* data files. 

5. (previously presented): The method of claim 1, further comprising determining, 
for each of the selected data files, at least one novelty value based on at least one similarity value 
of the selected data file and a nearby data file. 

6. (currently amended): The method of claim 5, wherein determining at least one 
novelty value comprises determining at least one novelty value as: 



[[v A (5)=X5 A .(5 + l,5 + «)g(l,«)]] 



v K (s) = J S K (s + 1, s + n)g(l, n) 
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where: 

v K (s) is the novelty value; and 

g is a Gaussian tapered (21+1) x (2n+l) checkerboard kernel. 

7. (previously presented): The method of claim 5, further comprising determining at 
least one boundary location between the selected data files based on the at least one novelty 
values determined for each of the selected data files. 

8. (previously presented): The method of claim 7, further comprising determining, 
for at least the one determined boundary locations, a confidence value for that boundary location. 

9. (previously presented): The method of claim 8, wherein determining a confidence 
value for a boundary location comprises determining the confidence value as: 




where: 

C(B K ) is the confidence value for the B K th boundary; 
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S K (i, j) is the similarity value for the i* data file and the f 1 data file; 

b is the index value of detected boundary at a particular value for the input clustering 
sensitivity parameter K level. 

10. (previously presented): The method of claim 8, further comprising determining, 
for at least the one determined boundary locations, at least one clustering sensitivity parameter 
value that maximizes the confidence value. 

1 1 . (currently amended): A method for organizing a plurality of data files stored in a 
digital memory using meta-data, wherein at least one meta-data element is at least associated 
with a corresponding one of the plurality of data files, the method comprising: 

extracting from the memory meta-data elements of the plurality of data files; 

organizing the extracted meta-data elements into a single ordered set wherein the set is 
ordered consecutively based on values for one or mor e of the extracted meta-data elements! 

calculating pair-wise difference between values of each of the extracted meta-data 
element s for all possible pairs of data files ; 

obtaining a desired value of a clustering sensitivity parameter for analyzing the meta- 
data; 
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multiplying each pair-wise difference by the clustering sensitivity parameter to obtain a 
plurality of similarity values having granularity defined by the clustering sensitivity parameter; 

determining a structure within the meta-data elements by comparing, for at least a subset 
of the plurality of data files, the similarity values; and [[ , ]] 

storing the structure of the data files in a memory. 

12. (previously presented): The method of claim 1 1 , further comprising clustering the 
data files into groups using the determined structure of the meta-data elements. 

13. (original): The method of claim 12, further comprising determining boundaries 
from the determined clusters of data files, wherein the boundaries are located between the 
determined clusters of data files. 

14. (previously presented): The method of claim 13, further comprising: 

determining a similarity value by comparing at least some of the meta-data elements 
associated with data files in a first selected cluster of data files to at least some other meta-data 
elements associated with data files of in the first selected cluster of data files; and 
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determining a dissimilarity value by comparing at least some of the meta-data elements 
associated with data files in a second selected cluster of data files to at least some of the meta- 
data elements associated with data files in a third selected cluster of data files, wherein the 
second selected cluster of data files and the third selected cluster of data files are different 
clusters of data files. 

15. (original): The method of claim 14, further comprising: 

determining a value corresponding to a desired grouping of the clusters of data files based 
on the differences of the similarity values and the dissimilarity values. 

16. (currently amended): A storage medium storing a set of program instructions 
executable on a data processing device and usable to organize a plurality of data files by using 
meta-data wherein at least one meta-data element is associated with each data file, the program 
comprising: 

instructions for extractin g, for at l e ast some of th e data files, the at least one meta-data 
element associated with each of the som e data files; 

instructions for organizing the extracted meta-data elements into a single ordered set 
wherein the set is ordered consecutively based on at least one of: chronological, alphabetical, 
numerical, or geographical ordering; 
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instructions for calculating pair-wise difference between values of each of the extracted 
meta-data elements for all possible pairs of data files ; 

instructions for inputting at least one clustering sensitivity parameter value, said 
clustering sensitivity parameter defining granularity of the clustering; 

instructions for multiplying each pair-wise difference by the clustering sensitivity 
parameter to obtain a plurality of similarity values having granularity defined by the clustering 
sensitivity parameter; and 

instructions for dividing the data files into groups based on the similarity values of the 
extracted meta-data elements. 

17. canceled. 

18. (previously presented): The storage medium of claim 16, further comprising 
instructions for determining, for each of the at least some data files, at least one novelty value for 
that data file based on at least one similarity value of that data file and a nearby data file. 
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19. (previously presented): The storage medium of claim 16, wherein instructions for 
determining the similarity value of the one pair comprises instructions for evaluating the 
similarity value as: 

^(z,y) = exp^-t^lj, 

where: 

the one pair consists of the z -th data file and / h data file; 

S K is the similarity value for the z' th data file and the / h data file; 

K is the clustering sensitivity parameter value; 

U is the value of a selected extracted meta-data element associated with the z th 
data file; and 

tj is the value of a selected extracted meta-data element associated with the 7 th 
data file. 

20. (previously presented): The storage medium of claim 16, wherein instructions for 
determining the similarity value of the one pair comprises determining the similarity value of the 
one pair as: 
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where: 

the one pair consists of the z -th data file and / h data file; 

S K is the similarity value for the z* data file and the f h data file; 

K is the clustering sensitivity parameter value; and 

V/ and vj are actual vector values determined from the i th and the data files. 

21. (previously presented): The storage medium of claim 18, further comprising 
instructions for determining at least one boundary location between the some data files based on 
the at least one novelty values determined for the each of the at least some data files. 

22. (currently amended): The storage medium of claim 18, wherein instructions for 
determining at least one novelty value comprises instructions for determining the at least one 
novelty value as: 

[K(*) = Z^ + l,s + «)g(l,n)]] 

i,« 

where: 
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vk(s) is the novelty value; and 

g is the Gaussian tapered (21+1) x (2n+l) checkerboard kernel. 

23. (previously presented): The storage medium of claim 21, further comprising 
instructions for determining, for at least the one determined boundary locations, a confidence 
value for that boundary location. 

24. (original): The storage medium of claim 23, wherein instructions for determining 
at least one confidence value comprises instructions for determining each of such confidence 
value as: 

W ( b M~ b \) i,j=bX 1=1 \Kl-W)V>l + 2- b M) i-hj-bM 

where: 

C(B K ) is the confidence value for the B K th boundary; 

S K is the similarity value for the i th data file and the j th data file; 

b is the detected boundary at a level. 
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25. (previously presented): The storage medium of claim 23, further comprising 
instructions for determining, for at least the one determined boundary location, at least one 
clustering sensitivity parameter value that maximizes the confidence value. 

26. (currently amended): A computerized system usable to organize a plurality of 
data files using meta data having at least one meta data element that is at least associated with a 
corresponding one of the data files, comprising: 

a memory for storing the data files; 

an input link for receiving the data files; 

a meta-data extracting circuit [[, routine, or application ]] that extracts, for at least som e 
of the data files, the_at least one meta-data element associated with that data file; 

a meta-data organizing circuit [[, routine or application ]] that organizes the extracted 
meta-data elements in a desired order based on values for the extracted meta-data elements; 

a similarity value determining circuit [[, routine or application ]] that: 

selects at least one clustering sensitivity parameter value, said clustering 
sensitivity parameter defining granularity of clustering [[;]] , and 

calculates a similarity value for at l e ast two of all possible pairs of data files in the 
plurality of data files by performing a pair-wise comparison of the extracted meta-data 
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element s, corresponding to all possible pairs of data files, and multiplying the pair-wise 
comparison by the clustering sensitivity parameter value; 

a novelty value determining circuit [[, routine or application ]] that determines at least 
one novelty value for that data file based on the at least one similarity value for that data file and 
for a number of nearby data files; 

a data dividing determining circuit [[, routine or application ]] that divides at least some 
of the data files into groups based on the extracted meta-data elements and the input clustering 
sensitivity parameter value by determining at least one boundary location between ones of the 
plurality of data files based on the at least one novelty value determined for at least some of the 
data files; and 

a confidence value determining circuit [[, routine or application ]] that determines, for at 
least some of the determined boundary locations, a confidence value for that boundary location, 
wherein the data dividing circuit [[, routine, or application ]] further determines, for at least some 
of the determined boundary locations, the at least one clustering sensitivity parameter value that 
maximizes the confidence value; and [[ , ]] 

an output link for outputting the meta-data elements in a desired grouping order^ 

wherein one or more processors control operation of the circuits . 

27. (previously presented): The method of claim 1, wherein determining the 
similarity value of the one pair comprises a term depending on a scalar magnitude of a difference 
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between t t and tj relative to K, where K is the clustering sensitivity parameter value, i references 
a first data file of the one pair, j references a second file of the one pair, is the value of a 
selected extracted meta-data element associated with the first data file, and tj is the value of a 
selected extracted meta-data element associated with the second data file. 

28. (previously presented): The method of claim 27 wherein the depending 
comprises an exponentially decreasing function of the scalar magnitude of the difference 
between U and tj relative to K. 

29. (previously presented): The method of claim 1, wherein detemining the 
similarity value of the at least one pair of the selected data files comprises a term depending on 
an inner product of v,- and vj relative to K, where K is the clustering sensitivity parameter value, v ; 
is an actual vector value determined from the 1 th data file, and y,- is an actual vector value 
determined from the j th data file. 
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